Efficient calculation of empirical P-values for genome-wide linkage analysis through weighted permutation.

نویسندگان

  • Sarah E Medland
  • James E Schmitt
  • Bradley T Webb
  • Po-Hsiu Kuo
  • Michael C Neale
چکیده

Linkage analysis in multivariate or longitudinal context presents both statistical and computational challenges. The permutation test can be used to avoid some of the statistical challenges, but it substantially adds to the computational burden. Utilizing the distributional dependencies between p (defined as the proportion of alleles at a locus that are identical by descent (IBD) for a pairs of relatives, at a given locus) and the permutation test we report a new method of efficient permutation. In summary, the distribution of p for a sample of relatives at locus x is estimated as a weighted mixture of p drawn from a pool of 'representative' p distributions observed at other loci. This weighting scheme is then used to sample from the distribution of the permutation tests at the representative loci to obtain an empirical P-value at locus x (which is asymptotically distributed as the permutation test at loci x). This weighted mixture approach greatly reduces the number of permutation tests required for genome-wide scanning, making it suitable for use in multivariate and other computationally intensive linkage analyses. In addition, because the distribution of p is a property of the genotypic data for a given sample and is independent of the phenotypic data, the weighting scheme can be applied to any phenotype (or combination of phenotypes) collected from that sample. We demonstrate the validity of this approach through simulation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An application of the latent p value method to assess linkage in asthma pedigrees.

OBJECTIVE The latent p value is a recently proposed empirical method for assessing evidence against a null hypothesis in a stochastic system involving latent, unobservable variables. It is particularly applicable to genome-wide genetic linkage analysis for test statistics with poorly defined analytical distributions. METHODS We describe an implementation of the latent p value method and its a...

متن کامل

An evaluation of the replicate pool method: quick estimation of genome-wide linkage peak p-values.

The calculation of empirical p-values for genome-wide non-parametric linkage tests continues to present significant computational challenges for many complex disease mapping studies. The gold standard approach is to use gene dropping to simulate null genome scans. Unfortunately, this approach is too computationally expensive for many data sets of interest. An alternative, more efficient method ...

متن کامل

An efficient resampling method for assessing genome-wide statistical significance in mapping quantitative trait Loci.

Assessing genome-wide statistical significance is an important and difficult problem in multipoint linkage analysis. Due to multiple tests on the same genome, the usual pointwise significance level based on the chi-square approximation is inappropriate. Permutation is widely used to determine genome-wide significance. Theoretical approximations are available for simple experimental crosses. In ...

متن کامل

Uncovering Networks from Genome-Wide Association Studies via Circular Genomic Permutation

Genome-wide association studies (GWAS) aim to detect single nucleotide polymorphisms (SNP) associated with trait variation. However, due to the large number of tests, standard analysis techniques impose highly stringent significance thresholds, leaving potentially associated SNPs undetected, and much of the trait genetic variation unexplained. Pathway- and network-based methodologies applied to...

متن کامل

INRICH: interval-based enrichment analysis for genome-wide association studies

SUMMARY Here we present INRICH (INterval enRICHment analysis), a pathway-based genome-wide association analysis tool that tests for enriched association signals of predefined gene-sets across independent genomic intervals. INRICH has wide applicability, fast running time and, most importantly, robustness to potential genomic biases and confounding factors. Such factors, including varying gene s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Behavior genetics

دوره 39 1  شماره 

صفحات  -

تاریخ انتشار 2009